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ABSTRACT 

A  Pascal  program  to  compute  McCabe's  control  flow  complexity  metric  for 
FORTRAN  modules  is  presented.  McCabe's  metric  which  is  called  the  cyclomatic 
number  gives  the  size  of  any  basis  set  of  control  flow  paths  through  a  program 
module.  McCabe's  metric  is  a  useful  indicator  of  the  level  of  difficulty 
required  to  test  and  maintain  a  program  module. 
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This  memorandum  was  prepared  under  Job  Order  No.  771VOO,  Special  Projects 
and  Studies.  The  authors  are  located  at  the  Naval  Underwater  Systems  Center, 
New  London,  Connecticut,  06320. 
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INTRODUCTION 

McCabe's  cyclomatic  number  (denoted  v(G))  is  a  control  flow  complexity 
measure  derived  from  graph  theory  by  Thomas  J.  McCabe  [1].  It  measures  the 
size  of  the  smallest  set  of  paths  which  will  generate  every  possible  oath 
through  a  program  module.  This  set  may  be  thought  of  as  a  basis  set  (of 
control  flow  paths)  for  the  module. 

The  cyclomatic  number  has  been  proposed  for  several  uses,  among  them 
determining  the  minimum  number  of  test  cases  required  to  test  every  statement 
in  a  program  and  as  a  method  of  controlling  module  size.  McCabe  suqqests  that 
a  v ( G )  greater  than  ten  may  lead  to  testinq  and  maintenance  problems. 

The  proqram  presented  to  compute  McCabe's  cyclomatic  number  is  written  in 
Pascal  and  computes  the  cyclomatic  number  as  a  function  of  the  decision  nodes 
in  a  proqram  module.  A  user  settable  switch  is  provided  to  allow  the  optional 
viewinq  of  the  decision  nodes  as  a  module  analysis  aid.  The  program  will 
handle  any  number  of  FORTRAN  modules  (main  programs,  subroutines,  functions) 
in  the  file  it  examines  as  long  as  each  has  it's  own  "END"  statement. 


BACKGROUND 

McCabe's  cyclomatic  number  is  a  proqram  flow  complexity  metric  taken  from 
Graph  theory  by  defininq  a  mapping  between  a  proqram  module  and  a  directed 
graph  classically  called  the  proqram  control  graph.  Blocks  of  sequential  code 
are  mapped  into  the  nodes,  and  transfers  of  control  between  blocks  into  the 
arcs  of  the  qraph. 

The  cyclomatic  number  ( v ( G) )  of  any  graph  G  with  e  edqes,  n  nodes  and  p 
connected  components  (in  this  case,  the  number  of  connected  components  can  be 
assumed  to  be  one)  is  defined  to  be: 

v(G)  =  e  -  n  +  2d  (1) 

The  cyclomatic  number  is  the  number  of  linearly  independent  paths  throuqh 
the  qraph,  which  when  taken  in  combination  generate  all  possible  paths  throuqh 
the  qraph.  So,  by  the  mappinq,  the  maximum  number  of  linearly  independent 
oaths  throuqh  the  program  module  is  given  by  the  cyclomatic  number.  In  his 
paper,  McCabe  shows  that  v(G)  may  also  be  computed  by: 

v(G)  =  d  +  1  (2) 

where  d  is  the  sum  of  one  less  than  the  number  of  outgoing  arcs  from  each 
decision  node.  (A  decision  node  is  a  node  that  has  two  or  more  outgoing  arcs.) 
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Figure  1 

FUNCTION  SIMPSN(FUNC.XMIN.XMAX.N) 

C 

C  .THIS  FUNCTION  INTEGRATES  A  FUNCTION  BY  SIMPSON'S 

C  ...RULE;  FUNC  IS  THE  NAME  FOR  THE  DUMMY  FUNCTION  TO 

C  ...  BE  INTEGRATED 

C 

H=(XMAX-XMIN)/N 
SUM=0.0 
X=XMIN+H 
DO  10  I -2 , N 

IF  (MOD(I ,2) ,EQ.O)THEN 
SUM=SUM+4. *FUNC(X) 

ELSE 

SUM=SUM*2 . *FUNC(X) 

END  IF 
X  =  X+H 

10  CONTINUE 

SIMPSN=H/3. «(FUNC(XMIN)+SUM+FUNC(XMAX) ) 

RETURN 

END 


In  figure  1  a  simple  FORTRAN  module  is  presented.  In  figure  2  this 
module  is  broken  into  blocks  that  represent  nodes  in  graphical  form.  Figure  3 
shows  the  equivalent  program  flow  graph. 

Figure  2 


FUNCTION  SIMPSN(FUNC ,xmin ,XMAX,N) 

c 

c  ...THIS  FUNCTION  INTEGRATES  A  FUNCTION  BY  SIMPSON'S 

C  ...RULE;  FUNC  IS  THE  NAME  FOR  THE  DUMMY  FUNCTION  TO 

C  . . .BE  INTEGRATED 

C 

C - | -node  A  - 

C  V 

H- ( XMAX-XMI N ) /N 
SUM=0 . 0 
X=XMIN-"H 

C - |-node  B  -  (  I  oop  I  dec  i  s  i  on )  )  — 

C  V 

00  10  1  =  2, N 

C - | -node  C  -(decision) - 

C  V 

IF  ( MOD (1,2), EQ . 0 ) THEN 

C - |-node  D - 

C  V 

SUM=SUM+4 . «FUNC(X) 

C - |-node  E  - 

C  V 

ELSE 

SUM=SUM+2 . *  FUNC (  X  ) 

END  IF 

C - |-node  F  - 

C  V 

X  =  X  +  H 

10  CONTINUE 

C - i-node  G - 

C  V 

SIMPSN=H/3 . * (FUNC(XMIN)»SUM»FUNC(XMAX) ) 

RETURN 

END 

c - 
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Figure  3 


A.  S 
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By  counting  the  number  of  nodes  and  edges  and  applying  (1),  or 
equivalently  counting  the  number  of  decision  nodes  and  applying  (2),  the 
cyclomatic  number  of  the  FORTRAN  program  module  may  be  obtained.  In  the 
example  above: 

v  ( G )  =  e-  n  +  2p  =  8-  7  +  2  =  3 
or 

v  ( G)  =  d  +  1  =  2  +  1  =  3 

In  particular,  each  of  the  following  sets  of  paths  is  a  basis  for  the 
graph  in  figure  3: 

|B  C  D  F  B,  8  C  E  F  B,  A  B  G|  (3) 

{A  8  C  0  F  B  G,  A  B  C  E  F  B  G,  A  B  G}  (4) 

where: 

BCDF8=ABCDF8G-ABG 
BCEF8  =  A8CEFBG-ABG 
Some  properties  of  v(G)  of  interest  follow: 

1.  v(G)  depends  only  on  the  decision  structure  of  G. 

2.  Addinq  or  deleting  function  nodes  in  the  module  does  not  change  v(G). 

3.  The  total  cyclomatic  number  of  a  proqram  is  equal  to  the  sum  of  the 
cyclomatic  numbers  of  the  modules. 
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APPLICATIONS 

The  cyclomatic  number  bounds  the  minimum  number  of  test  cases  that 
exercise  all  the  statements  in  a  program  module.  Since  a  program  may  have  an 
infinite  number  of  paths  through  it  (an  example  is  DO  WHILE  (Niagara  Falls)), 
it  is  not  always  possible  to  check  every  path.  However,  it  is  possible  to 
check  every  statement.  Since  in  the  traversal  of  a  maximal  set  of  linearly 
independent  paths  every  node  of  the  program  control  graph  is  visited,  the 
cyclomatic  number  is  an  upper  bound  on  the  minimum  number  of  tests  required  to 
visit  all  statements  in  a  program  module.  For  example,  we  draw  the  reader's 
attention  to  the  basis  set  (4)  for  Figure  3,  where  only  two  of  the  three  paths 
have  to  be  traversed  to  visit  every  statement  in  the  program.  On  the  other 
hand,  traversing  all  paths  in  a  basis  set  guarantees  that  every  arc  in  the 
program  control  graph  is  also  traversed.  Thus,  if  one  can  identify  a  basis  of 
v(G)  paths  (like  (4))  that  traverse  the  program  control  graph  from  entry  node 
to  exit  node,  and  select  a  corresponding  set  of  v(G)  tests  that  visit  these 
paths,  then  one  can  exercise  every  transfer  of  control  between  program  blocks, 
as  well  as  every  statement  in  the  program. 

It  should  be  pointed  out  that  having  each  and  every  transfer  of  control 
traversed  at  least  once  is  more  stringent  than  having  each  and  every  statement 
executed  at  least  once.  This  is  so  because  visiting  each  and  every  transfer 
of  control  implies  that  every  statement  in  the  module  is  executed,  but  not 
conversely.  This  is  illustrated  in  Figure  4,  where  traversal  of  the  path  PQR 
executes  every  statement  in  the  program  but  fails  to  execute  the  branch  (P,R). 

F i gure  4 
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However,  it  should  be  emphasized  that  the  Pascal  program  described  in  the 
next  two  sections  computes  only  the  cyclomatic  number.  Developing  a  program 
that  identifies  a  basis  set  of  paths  is  a  matter  that  requires  further  study. 

In  any  event,  the  cyclomatic  number  can  be  used  to  improve  a  program  module's 
testability  -  if  a  module's  v(G)  is  greater  than  some  threshold  value,  then  it 
should  be  redesigned  to  drive  its  v { G)  below  the  threshold,  so  that  a  smaller 
basis  set  of  tests  will  visit  every  statement  in  the  program.  Although  McCabe 
suggests  a  threshold  value  of  10,  this  value  is  by  no  means  sacrosanct. 

ALGORITHM 

The  Pascal  program  computes  the  cyclomatic  number  by  operating  on  all  the 
decision  nodes  in  a  file  until  it  encounters  a  FORTRAN  "END"  statement.  Each 
decision  node  is  converted  into  a  positive  integer  that  is  accumulated  in  a 
running  sum.  When  an  "END"  statement  is  detected,  the  number  one  is  added  to 
the  sum  and  then  the  result  is  displayed  as  the  cyclomatic  complexity  count 
for  the  module.'  The  running  sum  is  then  set  to  zero  and  the  process  repeated 
until  the  end  of  the  file  is  reached. 

Specific  decision  nodes  are  found  by  ignoring  FORTRAN  comments  and 
building  complete  FORTRAN  statements  from  the  remaining  source.  When  a 
FORTRAN  statement  is  completed,  it  is  examined  for  keywords.  If  the  statement 
is  identified  as  a  decision  node,  then  either  the  number  one  or  one  less  than 
the  number  of  exits  from  the  decision  node  is  added  to  the  running  sum. 
Otherwise,  the  statement  is  discarded  and  the  next  statement  is  constructed 
and  examined.  If  an  "END"  statement  is  detected,  the  cyclomatic  number  is 
computed  and  displayed.  When  an  end  of  file  condition  is  detected,  the 
program  ends. 

Because  the  program  examines  statements  in  this  manner,  there  is  no  limit 

on  the  number  of  lines  in  the  FORTRAN  source  file  or  on  the  number  of  modules 

in  the  file.  The  maximum  number  of  decision  nodes  that  it  will  count  in  any 

one  module  is  limited  only  by  the  respective  (UNIVAC  or  VAX)  Pascal’s  maximum 

integer  (MAXINT).  The  program  accepts  ANSI-77  FORTRAN  and  recognizes  the 
decision  nodes  shown  in  Figure  5. 

The  Pascal  program  will  not  process  FORTRAN  modules  that  contain  tab 
characters.  Tab  characters  may  be  removed  by  using  the  CLEAN  utility  program 
[4]. 
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Figure  5 

The  Pascal  program  recognizes  the  following  FORTRAN  control 
structures  and  assigns  each  a  complexity  count  of  1: 

1 )  (  DO  I  oops  } 

DO  <label>  <integer  variable-*  =  < f  i  e I d* , <f i e 1 d* 

2  )  {  Block  DO  loops  } 

DO  <integer  variable*  =  <f i e 1 d* , <f i e I d* 


END  DO 

3)  (  DO  WHILE  loops  ) 

DO  < I abe I >  WHILE  ('Boolean  expression*) 

4)  {  Block  DO  WHILE  luops  ) 

DO  WHILE  (<DOOlean  expression*) 


END  DO 

Ol  (  Logical  IF  ) 

IF  I 'Ooalean  expression*)  ^FORTRAN  key  word* 

b)  {  Logical  IF  in  ELSE  clause  5 

ELSE  IF  (<ooolean  expression*)  <FORTRAN  key  word* 

The  Pascal  program  assigns  the  complexity  numoer  of  one  less 
than  the  number  of  unique  statement  labels  in  the  associated 
label  list  of  each  of  the  following  control  structures: 

7 )  {  Ar i thmet i c  IF  ) 

IF(<variaDle*)<l abe I  1  * , < 1 abe )2*,<label3> 

8)  (  Arithmetic  IF  in  ELSE  clause  } 

ELSE  IF(<variable*)<label 1>,<label2*,<label3* 

9J  {  Computed  GOTO  ] 

GOTO(<label 1>, . . . , < 1 abe 1 n* ) .'integer  variable* 

10)  (  Computed  GOTO  in  ELSE  clause  ) 

ELSE  G0T0( < 1 abe II 1 abe 1 n> ).< i nt eger  variable* 

The  Pascal  program  allows  Computed  GOTO's  and  Arithmetic  IF 
statements  to  follow  Logical  IP’s  (or  ELSE  IF's)  as  per  ANSI_77f5J 

11)  (  Arithmetic  IF  following  Logical  IF  ) 

[ELSE|  IF(<doolean  expression*)  I F ( <var i ab I e* ) 

»  < 1 abe I  1  * , < 1 abe I  2* , < 1 abe 1 3* 

12)  (  Computed  GOTO  following  Logical  IF  ) 

IELSE]  IF(<Ooolean  expression*)  GOTO ( < I abe I  1  * , . . , <  1  abe I n> )  , 

♦  <integer  variable* 
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USING  THE  PROGRAM 

In  the  two  examples  below,  everything  that  the  user  would  type  is 
under! ined. 

I.  Usinq  the  Proqram  on  the  VAX  (V701,  V 703.  V70A) 

To  compute  the  c.yclomatic  complexity  of  the  program  in  fiqure  1  on  the 
VAX  the  followinq  OCL  may  be  used,  where  CYCL TEST. FOR  is  the  file  name  of  the 
proqram  in  fiqure  1. 

$  ASSIGN/USER  MODE  CYCLETEST.FOR  INFILE 
$  RUN  lMjG.CL£Arl  JCYCL 

print  out  decision  nodes  ?  <y,n>:  Y 

DO  10  1=2, N 

IF  (M0D( 1,2) .EQ. 01  THEN 

McCabe's  cyclomatic  number  for  module  1  is  3. 

<ENDFILE> 

The  utility  is  not  case  sensitive.  If  the  Tab-Formattinq  feature  available 
with  the  VAX  FORTRAN  has  been  used,  the  CLEAN  utility  will  have  to  be  used  before 
invoking  the  cyclomatic  number  program  [41. 

II.  Using  the  Proqram  on  the  UN  I  VAC: 

‘  * 

On  the  UNI  VAC,  the  cyclomatic  number  proqram  may  he  found  in 
CYCL*PAS.UTIL  on  node  2  in  New  London.  For  the  followinq  samole  runstream  an 
imaqinarv  FORTRAN  source  element  of  FOO*BAR.CYCLTEST  which  holds  the  prnqram 
qiven  in  Fiqure  1  will  be  used. 

>4>ASG.A  F00*BAR. 

>READY 

>aASG, T  INFILE. 

>READY 

>a)ED  FOO*BAR.CYCLTEST.  INFILE. 

»DXQT  CYCL*PAS.UTIL 
> 

>  print  out  decision  nodes  ?  <.y,n>:  Y 

> 

>  DO  10  1=2. N 

IF  ( MOD ( 1 , 2 ) . E Q . 0 ) THE N 

> 

>  McCabe's  cyclomatic  number  for  module  1  is  3. 

> 

>  <ENDFILE> 

> 
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CONCLUSION 

A  Pascal  program  is  presented  which  computes  McCabe's  cyclomatic  number, 
the  cardinality  of  any  basis  set  of  control  flow  paths  through  a  program 
module.  Since  all  paths  through  the  module  may  be  generated  from  a  basis  set, 
the  cyclomatic  number  bounds  the  minimum  number  of  tests  needed  to  exercise 
every  transfer  of  control  between  program  blocks  in  the  program  module,  as 
well  as  every  program  statement.  Therefore,  the  Pascal  program  may  be  used  to 
earmark  program  modules  for  redesign  that  require  a  large  number  of  tests  to 
validate  the  correctness  of  every  transfer  of  control.  The  redesign  effort 
should  strive  to  reduce  the  cyclomatic  number  of  any  program  module  to  some 
acceptable  threshold  value  in  order  to  improve  the  module's  maintainability 
with  respect  to  it's  testability. 
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